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1.1 Introduction 

Rapid advances in genetics, genomics, and imaging have given insight into the 
molecular and cellular basis of behaviour in a variety of model organisms with 
unprecedented detail and scope. It is increasingly routine to isolate behavioural 
mutants, clone and characterise mutant genes, and discern the molecular and 
neural basis for a behavioural phenotype. Conversely, reverse genetic approaches 
have made it possible to straightforwardly identify genes of interest in whole- 
genome sequences and generate mutants that can be subjected to phenotypic 
analysis. In this latter approach, it is the phenotyping that presents the major 
bottleneck; when it comes to connecting phenotype to genotype in freely behav- 
ing animals, analysis of behaviour itself remains superficial and time consuming. 
However, many proof-of-principle studies of automated behavioural analysis over 
the last decade have poised the field on the verge of exciting developments that 
promise to begin closing this gap. 

In the broadest sense, our goal in this chapter is to explore what we can learn 
about the genes involved in neural function by carefully observing behaviour. 
This approach is rooted in model organism genetics but shares ideas with ethol- 
ogy and neuroscience, as well as computer vision and bioinformatics. After in- 
troducing G. elegans as a model, we will survey the research that has led to the 
current state of the art in worm behavioural phenotyping and present current 
research that is transforming our approach to behavioural genetics. 

1.1.1 The worm as a model organism 

C. elegans is a nematode worm that lives in bacteria-rich environments such as 
rotting fruit and has also been isolated from insects and snails which it is thought 
to use for longer-range transportation (Barriere & Felix 2005) (Lee, Choi, Lee, 
Kim, Hwang, Kim, Park, Paik & Lee 2011). In the lab, it is commonly cultured 
on the surface of agar plates seeded with a lawn of the bacterium E. coli as a 
food source. On plates, worms lie on either their left or right side and crawl by 
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Figure 1.1 A) C. elegans is typically cultured on agar plates seeded with bacteria as a 
food source. Because of their small size, hundreds of worms can be grown on a 
standard 5 cm petri dish. B) Close-up of a worm crawling on an agar plate. C) When 
placed in a drop of liquid, C. elegans starts to swim. Swimming is characterised by 
more rapid body bending with a longer wavelength. 

propagating a sinuous dorso-ventral wave from head to tail. When immersed in 
a liquid, they can also swim (Fig. |1.1[ ) . 

As a model organism for the genetics of development, neuroscience, and be- 
haviour, C. elegans offers several advantages, ft progresses through its four lar- 
val stages in three and half days with each animal yielding about 300 progeny. 
Hermaphrodites can reproduce either by cross- or self-fertilisation; thus, mat- 
ing behaviour is not essential for the viability of mutant strains. Indeed, para- 
lyzed animals with almost no nervous system function can be propagated in the 
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laboratory, allowing the analysis of mutants defective in fundamental neuronal 
molecules such as synaptic vesicle proteins and voltage-gated channels. Both 
molccularly and anatomically C. elegans is exceptionally well-characterised. It 
was also the first multicellular organism to have its genome sequenced (elegans 
Sequencing Consortium 1998) and its well-annotated genome as well as infor- 
mation on mutants is available on the community website WormBase.org (Yook, 
Harris, Bieri, Cabunoc, Chan, Chen, Davis, de la Cruz, Duong, Fang, Ganc- 
san, Grove, Howe, Kadam, Kishore, Lee, Li, Muller, Nakamura, Nash, Ozersky, 
Paulini, Raciti, Rangarajan, Schindelman, Shi, Schwarz, Ann Tuli, Van Auken, 
Wang, Wang, Williams, Hodgkin, Berriman, Durbin, Kersey, Spieth, Stein & 
Sternberg 2012). It remains the only organism whose connectome (the wiring di- 
agram of all its 302 neurons) has been completely mapped at electron microscopic 
resolution (White, Southgatc, Thomson & Brenner 1986). The connectome and 
many other details of the worm's anatomy and physiology are available on Wor- 
mAtlas.org. Finally, the developmental lineage of each of its 959 adult cells is 
known and is highly repeatable from one individual to the next (Sulston 1977). 
In other words, given a cell in the adult, one can look up the complete list of 
divisions that led to that cell from the single-celled embryo. 

1.1.2 The Genetics of C. elegans 

The genetic study of C. elegans began in earnest with Brenner's 1974 paper in 
which he screened chemically mutagenised populations of worms for morpho- 
logical and locomotory defects (Brenner 1974). Males occur spontaneously at 
low rates (<~0.1%) and this rate can be increased using heat shock. These males 
can then be used in genetic crosses and to perform complementation tests and 
mapping. Using this approach, Brenner isolated numerous mutants with visible 
phenotypes, including mutants with movement defects comprising 77 comple- 
mentation groups. Since then, many more mutants have been identified, cloned, 
and often further characterised molecularly using variations on this approach. 

Forward genetic approaches continue to play an important role in uncovering 
the function of nervous system genes, but with the sequencing of the C. elegans 
genome in 1998 (elegans Sequencing Consortium 1998), reverse genetics is also 
being extensively used. 

Targeted gene deletion has been demonstrated in worms (Fr0kjaer- Jensen, 
Davis, Hollopetcr, Taylor, Harris, Nix, Lofgren, Prestgard-Duke, Bastiani, Mo- 
erman & Jorgensen 2010), but has not yet been widely adopted. In practice, 
reverse genetics in C. elegans is done principally in two ways: by screening li- 
braries of mutagenised genomes for those containing a deletion of interest and by 
feeding worms with bacteria that express an appropriate small interfering RNA. 

In the first approach, a population of worms is mutagenised using psoralen 
and UV irradiation which produces more small and medium deletions than the 
more standard mutagen EMS. The population is screened using PCR primers 
designed around the gene to be deleted until a match is found. Including library 
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construction, a strain carrying a desired deletion allele can be isolated in about 
two months (Ahringer 2006). The procedure has been standardized and is now 
performed by the C. elegans Gene Knockout Consortium as a service to the 
worm research community. Knockouts of new genes, or new alleles of previously 
knocked out genes, can be ordered online. Knock out and other strains can be 
ordered for a nominal fee from the C. elegans Genetics Centre at the University 
of Minnesota. 

RNA interference-based gene knockdown can be achieved in C. elegans sim- 
ply by feeding worms with bacteria that have been modified to express double 
stranded RNA complementary to the gene to be knocked down (Fraser, Kamath, 
Zipperlen, Martinez-Campos, Sohrmann & Ahringer 2000). This technique works 
because most nematode cells express the SID-1 transporter, which enables sys- 
temic uptake of double-stranded RNA triggers for RNAi. Libraries expressing 
dsRNA fragments of nearly all C. elegans open reading frames have been used 
to conduct near genome-wide knockdown screens for developmental and mor- 
phological phenotypes (Kamath, Fraser, Dong, Poulin, Durbin, Gotta, Kanapin, 
Le Bot, Moreno, Sohrmann, Welchman, Zipperlen & Ahringer 2003). Unfor- 
tunately, neurons do not express SID-1, which has limited the applicability of 
the original method to studying behaviour and nervous system function. Sev- 
eral approaches have been attempted to increase the efficiency of feeding RNAi 
with varying degrees of success (Simmer, Tijsterman, Parrish, Koushika, Nonet, 
Fire, Ahringer & Plasterk 2002) (Lehner, Calixto, Crombie, Tischler, Fortunato, 
Chalfie & Fraser 2006) (Kennedy, Wang & Ruvkun 2004). Recently, Calixto et 
al. demonstrated robust feeding RNAi using worms engineered to express SID-1 
transgenically in specific groups of neurons (Calixto, Chelur, Topalidou, Chen 
& Chalfie 2010). This exciting development may prove to be a key enabling 
technology for a genome-wide investigation of genes affecting behaviour. 

With these techniques, as well as the promise of population genomics, the tools 
for perturbation studies in C. elegans currently significantly outstrip behavioural 
phenotyping methods in both throughput and sophistication. However, this is 
changing, as we will describe after a brief introduction to some of what is already 
known about C. elegans behaviour. 

1.1.3 Classic approaches to behaviour 

Nematode behaviour has been studied by researchers for well over 50 years. Early 
work examined diverse species and their entire range of behaviour from escape 
from the egg, through molting, feeding, and mating. Several modes of locomotion 
were observed from crawling on solid surfaces to swimming and even jumping. 
Croll thoroughly reviewed this early work in Croll (1975a). For a more current 
survey of work on nematode behaviour, see Gaugler & Bilgrami (2004). 

Furthermore, much of this work was not simply descriptive. Quantitative anal- 
ysis and modeling was done using computer simulations of aspects of nema- 
tode exploration (Croll & Blair 1973), mechanical models of swimming motion 
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Figure 1.2 Worms leave tracks in an agar plate and this was used for track analysis. 
Adapted from Croll (19756). Pairs of tracks labeled A-D are taken from the same 
individual at different times to illustrate the observed variation. 



(Wallace 1968), and quantification of many worm locomotion features including 
wavelength, frequency and duration of reversals (Croll 19756). These are all still 
areas of active interest. Perhaps most relevant to our discussion here is the work 
on track analysis in C. elegans in which single or multiple worms are allowed to 
crawl on a fresh agar surface and inscribe a small crevice. At the end of the ob- 
servation time, the agar is transferred to a photographic enlarger and projected 
onto film. This provided a permanent record of the data and made subsequent 



analysis easier (Fig. 1.2). 

Track analysis gave quantitative insight into possible dispersal strategies and 
the behavioural mechanisms of chemotaxis. Interestingly, this method is amenable 
to high-throughput experiments with very low equipment cost, although as far 
as we know this has never been attempted. Many worms could be tracked si- 
multaneously, limited only by time to transfer single worms to plates and the 
analysis could be done on a single desktop computer. Nonetheless, track analysis 
does have its disadvantages. For example, worms can only be followed until they 
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reach the edge of their arena and there is no information about the dynamics of 
the motion other than the total time taken for the path. 



1.2 High Throughput Data Collection and Information Extraction 

There are two classes of problems that must be solved to bring behavioural phe- 
notyping to a level that is truly commensurate with that of current molecular 
tools. The first is collecting rich high throughput data and the second is ex- 
tracting meaning from this wealth of data in an unbiased and quantitative way. 
There are several important advantages to both of these aspects of automated 
behavioural fingerprinting. The first is simply that it is essential to collect data 
from a large number of strains to get a sense for the phenotypic landscape and 
to cover as many interesting genes as possible. The second is that a carefully 
engineered data collection pipeline will be naturally standardised and should be 
more reproducible over time, between operators, and between labs. Perhaps most 
importantly, quantitative feature extraction and data analysis lead to abstrac- 
tion. Once behavioural data have been abstracted, the precise source of the data 
matters less than its form, meaning that the extensive tools of statistics and 
especially bioinformatics can be brought to bear, potentially revealing subtle re- 
lationships between phenotypes that may reflect underlying genetic connections. 

As we will see in the next two sections many of these problems have been solved 
at the demonstration level, but the real power of this approach to behavioural 
genetics has yet to be realised. 

1.2.1 Worm Tracking: Throughput and Resolution 

Worm tracking simply refers to following a moving worm over time, typically 
recorded in video. The approaches taken so far in the field can be divided ac- 
cording to resolution: low-resolution tracking allows many worms to be captured 
in a single field of view (multi-worm tracking) but results in a less detailed pic- 
ture of each worm's behaviour (Fig. [L3K), while high-resolution single worm 
tracking provides more detail (Fig. |1.3f 3) but requires the system to follow a 
single worm in the camera's field of view for long enough to collect a reasonable 
amount of behaviour. 

The first automated worm tracker was described in 1985 in a remarkable pa- 
per by Dusenbery (Dusenbery 1985). Using computer and video equipment that 
are rudimentary by today's standards he was able to record the velocity and 
reversal rate of 25 worms at 1 Hz, updated in real time. He and collaborators 
then used the system to study C. elegans^ response to oxygen and carbon dioxide 
(Dunsenbery 1985) and to a variety of chemicals (Williams & Dusenbery 1990). 
Following this pioneering effort, there was a period with relatively little work on 
automated tracking even though there was rapid progress in behavioural genetics 
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Figure 1.3 A) Multiple worms are simultaneously tracked at low resolution using a 
multi worm tracker. The results shown here were generated using the Parallel Worm 
Tracker from the Goodman lab. B) A single worm can be tracked using a motorized 
stage to keep the worm in the camera's field of view. The worm's outline and skeleton 
are determined in each frame. By recording the distance travelled by the stage, a 
detailed track can be reconstructed including high-resolution posture information 
(outlines and skeletons from each frame are overlaid here). 



in C. elegans during the same period that could have benefited from the technol- 
ogy. This eventually changed as more groups discovered the utility of automated 
behavioural analysis and numerous groups used some kind tracking to investi- 
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gate worm locomotion, including speed to detect variability in wild isolates from 
different regions (de Bono & Bargmann 1998), reversals and turns to under- 
stand chemotaxis (Picrce-Shimomura, Morse & Lockery 1999), or simply using 
tracking to record long periods of behaviour for subsequent manual annotation 
(Waggoner, Zhou, Schafer & Schafer 1998). Since then there has been signifi- 
cant interest in developing more user friendly and/or standardized approaches 
to tracking and analysis both for multi-worm and single worm trackers. 

We are aware of three general-purpose multi-worm trackers designed for broad 
use. NeMo tracks worm locomotion and includes a graphical user interface that 
allows users to adjust tracking parameters and correct errors (Tsibidis & Tavernarakis 
2007). It also includes functionality for the subsequent analysis of behavioural 
data. The Parallel Worm Tracker uses a simple threshold to distinguish worms 
from background and works with bright field and dark field videos (Ramot, John- 
son, Berry, Carncll & Goodman 2008) . This system also includes a graphical user 
interface for rejecting or splitting bad tracks and outputs a variety of movement 
parameters for subsequent analysis. Because both groups make their source code 
(written in Matlab) freely available, users can add new locomotion metrics rel- 
atively straightforwardly. The most sophisticated multi-worm tracker to date, 
specifically designed for high throughput, uses Labview to capture data from a 4 
megapixel camera as well as to do the first stage of data processing in real time 
(Swierczek, Giles, Rankin & Kerr 2011). 

The first single worm trackers were introduced to allow higher resolution track- 
ing of an individual for long periods of time; these were used for subsequent 
manual analysis of egg-laying behaviour (Waggoner et al. 1998) or for automated 
analysis of location data (Pierce-Shimomura et al. 1999) (Hardaker, Singer, Kerr, 
Zhou & Schafer 2001). Because only a single worm is tracked at a time, this ap- 
proach is more time consuming than multi-worm tracking, but it provides a more 
nuanccd picture of behaviour and makes it possible to extract detailed features 
of worm shape and locomotion, which we will discuss in the next section. Single 
worm trackers typically consist of a motorized stage mounted on a dissecting mi- 
croscope with a camera. A computer records video data from the camera that is 
stored for later analysis but is also used to identify the worm and update the stage 
position in real time to keep the worm centered in the field of view. Although 
there are several labs using single-worm trackers (Baek, Cosman, Feng, Silver 
& Schafer 2002) (Stephens, Johnson-Kcrncr, Bialek & Ryu 2008) (Wang, Sun, 
Dixon, Alexander & Roy 2009) (Feng, Cronin, Wittig, Sternberg & Schafer 2004) 
(Cronin, Feng & Schafer 2006), they have not yet been widely adopted due in 
part to the expense of the systems and the expertise required to configure the 
hardware and write the tracking software. 

To address this, our group has developed a user-friendly single worm tracker 
built around a small inexpensive USB microscope. The entire system can be 
purchased for under $5000 USD and will soon be paired with a feature-rich 
analysis package. A parts list, instructions for set up, and the tracking software 
itself can be downloaded from the worm tracker website of the MRC Lab of 
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Figure 1.4 Schematic illustration of the current tradeoff between content and 
throughput in behavioural phenotyping. Integrating automated sample handling will 
significantly improve throughput for imaging systems that can run sufficiently 
autonomously because picking worms to plates is time consuming and labour 
intensive. Content can be significantly increased using better imaging methods, 
especially functional neural imaging. In the near future it should be possible to 
extend both content and throughput by adapting and integrating existing technology. 



Molecular Biology (http://www.mrc-lmb.cam.ac.uk/wormtracker/). Because the 
system is relatively inexpensive, we have been using it to run eight single worm 
trackers in parallel and thus increase the throughput of single worm tracking. 

This highlights the basic trade-off between single and multi-worm tracking — 
throughput versus resolution — and how these limitations are being addressed 
from both sides (Fig. |1.4| ). In the case of multi-worm tracking, higher resolution 
cameras are making it possible to extract skeleton-based features of worms even 
in a field of view with many individuals (Swierczek et al. 2011), while less ex- 
pensive hardware coupled with free software is making it possible to increase the 
throughput of the inherently high resolution single worm approach. Still, with 
current technology, multi-worm trackers can collect data from a larger number 
of worms more quickly and single worm trackers still have a resolution advantage 
so the two methods will likely co-exist for some time. 
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1.2.2 Data Reduction and Abstraction: Segment, Skeletonise, Featurise 

Some early applications of worm tracking were simply for data collection with 
the analysis still done manually from recorded video. To really take advantage of 
the wealth of data provided by trackers, it is necessary to develop algorithms to 
extract worms from background and to ultimately convert the data into a form 
that is intcrprctablc by humans to give insight and guide future experiments. 

Using appropriate lighting conditions (Yemini, Kerr & Schafer 2011a) it is 
usually possible to achieve sufficient contrast, even with worms on a bacterial 
food lawn, that a simple global threshold does a good job of identifying worms. 
For multi-worm trackers, this is sometimes coupled with a size range (Ramot 
et al. 2008) to eliminate large or small background features that are included by 
the threshold. For single worm tracking, taking the largest connected component 
in the thresholdcd image is often sufficient. Still, there are circumstances in which 
more robust approaches are helpful. For example, worms in more naturalistic 
environments, in microfiuidic devices, in thick food, or interacting with each 
other present challenges for the simplest thresholding approaches. 

In an effort to make a more universal system for worm identification, Sznit- 
man et al. have taken a multi-environment model estimation approach to worm 
segmentation (Sznitman, Gupta, Hager, Arratia & Sznitman 2010). In the first 
frame of the video only, users must identify the worm to train a Gaussian mix- 
ture model used to classify background and foreground (worm) pixels. This model 
is then used in subsequent frames to identify the worm which is extracted for 
further analysis. The same system was able to extract swimming and crawling 
worms and worms in microfiuidic devices (Sznitman, Gupta, Hager, Arratia & 
Sznitman 2010). 

In addition to the pixel intensities within a frame, there is also significant 
information in the correlation between frames. This has been exploited in worm 
segmentation using a Kalman filter (Fontaine, Burdick & Barr 2006) or recursive 
Bayesian filter (Roussel, Morton, Finger & Roysam 2007) coupled with a worm 
model. Both approaches allow for the separation of overlapping worms which 
could improve multi-worm trackers which often simply drop worms that are 
touching each other from the analysis (Ramot et al. 2008). It could also help 
track worms in, for example, a thick bacterial lawn that can sometimes obscure 
portions of the worm. Fontaine et al. illustrated their method by studying mating 
(Fontaine et al. 2006), a behaviour that naturally requires partially overlapping 
individuals! 

A series of outlines over time is still not particularly useful without further 
processing. For lower resolution multi-worm trackers, the next step in analysis 
is typically to quantify aspects of locomotion and posture based on the segmen- 
tation. For example, velocity is simply the change in centroid position of each 
blob over time. Reversals and turns can be identified as sharp angle changes in 
the worm's path. Worm posture can be roughly approximated by the eccentric- 
ity of the worm's equivalent ellipse. For higher resolution multi-worm and single 
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worm trackers, more features are readily calculated. In addition to morphology 
measures based on the outline, skeletonisation leads to a further data reduction 
without loss of relevant information because worms' width profiles are essentially 
constant over time. 

There are several algorithms that are used for skeletonisation. The first and 
still widely used computes the skeleton by thinning the thresholded worm to a 
single pixel and pruning branches based, for example, on taking the longest path 
through the structure. This works reasonably well, but because of the possible 
ambiguity of choosing the correct branch at the ends, it is not ideal for picking up 
more subtle head- foraging motions (Huang, Cosman & Schafer 2008). It is also 
possible to use a chamfer distance transform coupled with curvature to estimate 
a skeleton (Sznitman, Gupta, Hager, Arratia & Sznitman 2010). An attractive 
approach given its speed and simplicity has recently been reported based on 
finding the points of highest curvature on the outline and tracing the midline 
connecting these points (Leifer, Fang- Yen, Gcrshow, Alkema & Samuel 2011). 
Because the tip of the head is centered on a local curvature maximum, it does 
a good job of picking up subtle head motions. At the resolution of a typical 
single worm tracker, worm skeletons are on the order of 100 pixels long. This 
is almost certainly an over sampling given that C. elegans has 22 rows of body 
wall muscles and therefore significantly fewer degrees of freedom than 100, even 
including the head which is capable of more complicated motions. 

Skeleton curvature can be used to identify even subtle differences in body 
posture and changes in skeleton curvature over time should in principle be able 
to distinguish different kinds of uncoordination. Furthermore, looking at specific 
sequences of postures has allowed the detection of known behaviours including 
reversals and reorientations called omega turns (Huang, Cosman & Schafer 2006). 

A complementary approach to C. elegans behavioural representation has been 
described by Stephens et al. (Stephens et al. 2008). They segment and skeletonise 
the worm in each video frame, but instead of looking at the skeleton positions 
they analyse skeleton angles, rotated by the mean angle, yielding a position and 
orientation independent representation of body postures over time. They found 
that the covariance matrix of these angles has a relatively smooth structure 
and that just four eigenvectors can capture 95% of the shape variance of worms 
crawling off food. They call these four principal components eigenworms, which 
are essentially four basis shapes that can be added together to reconstruct worm 
postures. This compact representation led to interesting results on dynamical 
models of C. elegans and even to the emergence of stereotyped behaviour without 
the requirement of a central pattern generator (Stephens, Bueno de Mesquita, 
Ryu & Bialck 2011). More recently, we have shown that the eigenworms derived 
from wild-type animals can also be used to capture the postures of mutant worms, 
even those that are highly uncoordinated (Brown, Yemini, Grundy, Jucikas & 
Schafer 2012). This extends the applicability of the eigenworm representation 
and provides a common and compact basis for capturing worm postures. 
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1.2.3 



Towards an OpenWorm Analysis Platform 



Given the variety of systems that have been developed for the automated anal- 
ysis of C. elegans behaviour from videos, we argue that now is a reasonable 
time to consider coordinating the efforts of individual groups and creating an 
open platform for worm behaviour. In part this could consist simply of shared 
knowledge of protocols and hardware design. More importantly though, a set 
of standard functions available in an easy to use and extendable package would 
help lower the barrier to entry to new groups and focus researchers with interest 
and skills on developing useful new analysis tools rather than re-inventing the 
wheel at the start of each new project. In the ideal case, the project would look 
something like Image J, the open source image analysis package developed at NIH 
(Abramoff, Magalhaes & Ram 2004). It contains many core functions in an easy 
to use interface but most importantly, it allows the incorporation of plugins and 
has been adopted by an active group of users and developers. Of course, the 
possible user base for such a package will be much less than for Image J, but its 
focus will be correspondingly narrower and hopefully still manageable. 



Sometimes knocking out a gene results in little or no observable behavioural 
consequence. In these cases, careful quantification can confirm a phenotype sug- 
gested by human observation or even reveal completely new phenotypes. For ex- 
ample, worms lacking an ion channel called trpa-1 move well and seem healthy 
when observed under a dissecting scope (Kindt, Viswanath, Macpherson, Quast, 
Hu, Patapoutian & Schafer 2007). However, on closer inspection you might notice 
an abnormally large head swing (sometimes called foraging motion). By quanti- 
fying the rate of head swings a subtle but reproducible phenotype emerged and 
this defect in foraging helped direct studies that revealed new aspects of trpa-1 
function (Kindt et al. 2007). 

A related situation arose in the study of proprioception, or sense of body po- 
sition, in C. elegans. Mutant worms lacking an ion channel called unc-8 were 
previously reported to have no visible phenotype (Park & Horvitz 1986), but 
upon closer inspection were found to have a shallower body bend during loco- 
motion than wild-type worms. This visual impression was confirmed using track 
analysis (Tavernarakis, Shreffler, Wang & Driscoll 1997). Because of the neural 
circuits where unc-8 is expressed and its homology to MEC-4 channels involved 
in mcchanoscnsation, it was hypothesized to have a role in proprioception and the 
propagation of the travelling wave worms use for locomotion. Similarly, mutants 
lacking an ion channel called trp-4 show a posture defect but still move well. 
However, in contrast to unc-8 animals, trp~4 worms have deeper body bends 
than the wild-type (Li, Feng, Sternberg & Shawn Xu 2006). Several experiments 
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now strongly suggest that trp~4 encodes a mechanosensitive ion channel with a 
role in proprioception (Kang, Gao, Schafer, Xie & Xu 2010). 

There are two aspects of these studies that are particularly interesting. First, 
phenotypic quantification was used to confirm subtle phenotypes which then 
guided functional experiments. Second, although the phenotypes were subtle, 
they were picked up by human observers so it was clear what needed to be quan- 
tified. To see if there are other channels in the trp and deg channel families 
(unc-8 is a DEG/ENaC channel and trp~4 a TRP channel) that affect curva- 
ture, we tracked worms (Yemini, Kerr & Schafer 20116) with mutations in one or 
sometimes several TRP or DEG/ENaC channels. Several of these mutants were 
significantly more or less curved than wild-type, possibly suggesting more chan- 
nels with roles in proprioception (Fig. 1.5). Note that unc-8 and trp~4 are at the 
extremes of the curvature ranges, perhaps explaining why these channels were 
the ones initially picked up by human observation. Although the other curvature 
phenotypes have not been previously reported, quantitative analysis can reliably 
detect them and they may prove just as useful in guiding functional studies. 



1.3.2 Unbiased reverse genetic analysis of C. elegans behaviour 

As the curvature example suggests, even looking at a single feature can reveal 
previously unobserved phenotypes. What might we learn if we instead took an 
unbiased look at many mutants and many features? There have been several 
papers that demonstrate the potential of this approach. 

The first application of automated tracking and machine vision-based analysis 
to C. elegans was reported by Baek et al. in 2002 using single worm tracking 
and algorithms to extract morphological, locomotion, and posture data (Baek 
et al. 2002). Using video data recorded from 5 mutant strains, they built a classi- 
fication tree that could reliably distinguish the different types. Shortly thereafter, 
Geng et al. used the same system with 8 mutant strains to show that the data 
could not only be used to classify worms based on their genotypes, but also 
cluster them based on their phenotypic similarity (Geng, Cosman, Berry, Feng 
& Schafer 2004). Although related, clustering phenotypically similar strains is 
more relevant than accurately dividing a mixed population into genotype classes 
because a worm's genotype can be easily and accurately determined directly. Fur- 
thermore, although it may seem reasonable to assume that features with high 
classification accuracy will also be informative for phenotypic clustering, this is 
not necessarily the case and even for the small number of strains reported in 
these papers, somewhat different features were found to be important for the 
clustering compared to the classification tasks. 

Since then, there have been several papers that report automated algorithms to 
quantify the behaviour of C. elegans mutants. Without being exhaustive, we will 
highlight some notable examples. A similar approach, but with different, com- 
plementary features was described by Cronin et al. in 2006 (Cronin et al. 2006). 
They applied their system to a small number of mutants as a proof-of-concept 
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Figure 1.5 Some mutant strains are more or less curved than the N2 reference strain 
(wild type). Each point in the plot represents an individual worm tracked for 15 
minutes. In each frame the change in tangent angle <f> is averaged over the skeleton. 
These approximately 25 000 measurements are averaged to give each point shown in 
the figure. Black bars are means and error bars show standard errors. The black 
dashed line shows the N2 mean. Strains that are significantly more or less curved 
than N2 are indicated with the dashed boxes. 

and showed that they could detect differences in mutant sensitivities to chemi- 
cals including arsenite and aldicarb. More recently, Sznitman et al. (Sznitman, 
Purohit, Krajacic, Lamitina & Arratia 2010) and Krajacic et al. (Krajacic, Shen, 
Purohit, Arratia & Lamitina 2012) have taken a novel approach to feature ex- 
traction based on a detailed mechanical model of swimming C. elegans that 
takes into account both the worm body and its interaction with the surrounding 
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fluid. What makes their approach unique is that the features they extract — worm 
stiffness and tissue viscosity — have a direct physical interpretation that cannot 
be directly seen in the videos. They show that some mutants with defective 
muscle can be distinguished from wild-type and from each other based on these 
estimated parameters. 

Stephens et al. have used their eigenworm representation combined with stochas- 
tic modeling to derive unique metrics that summarise worm behaviour and can 
distinguish mutant from wild type worms (Stephens, Bueno de Mesquita, Ryu 
& Bialek 2011). As with the approach of Sznitman et al, these biophysics-based 
features may serve as useful complements to more empirical measures of be- 
haviour for genetics. They will be especially useful to the extent that they guide 
subsequent functional studies by relating a mutant phenotype to specific aspects 
of a worm's physiology or nervous system. 

Multi-worm trackers have also been used to quantify behaviours of mutants 
and to look at the effects of, for example, drugs (Ramot et al. 2008) and temper- 
ature (Biron, Shibuya, Gabel, Wasserman, Clark, Brown, Scngupta & Samuel 
2006). As in the case of single worm trackers, these have been used to answer 
specific phenotypic questions or serve as proof-of-concept experiments on rela- 
tively small numbers of mutants. Swierczek et al. studied the response of popu- 
lations of worms to repeated plate tap to find mutants with altered habituation 
(Swierczek et al. 2011). They also showed the applicability of their system for 
studies of chemotaxis. They found some interesting hits in their sample of 33 
mutant strains and this will hopefully lead to a larger-scale study. 

What all of these studies have in common is their use of human-defined fea- 
tures to represent phenotypes. However, we do not know if these features are 
optimal for comparing phenotypes nor whether they are useful for discovering 
new behaviours. We have therefore also taken a complementary route, using 
an unsupervised search for behavioural motifs — short subsequences of repeated 
behaviour — to define phenotypes (Brown et al. 2012). We projected worm skele- 
tons onto wild-type-derived eigenworms and searched for closely repeated sub- 
sequences of different lengths and repeated this for many individuals in a large 
behavioural database. Some of the motifs represent subtle or irregular behaviours 
that are nonetheless closely repeated at two different times. We combined the 
motifs into a dictionary and each video was then compared to the each of the 
elements of the dictionary to make a phenotypic fingerprint for that individual. 
This is analogous to using features as described above, but now each "feature" 
is a distance from an automatically identified behavioural motif from the dictio- 
nary. Phenotypic comparisons based on the motif dictionary recapitulated some 
known genetic relationships and were used to hypothesise connections between 
previously uncharacterised genes (Brown et al. 2012). 
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1.4 Outlook 

1.4.1 Increasing throughput and discrimination 

It is clear from the previous sections that the core technologies for high-throughput 
quantitative phenotyping of C. elegans are within reach, but there remain several 
challenges that must be overcome. A framework for understanding the current 
state of the art and directions for future improvement is summarised in Fig. |1.4| 
In essence, it is difficult to achieve both high content and high throughput in 
a single system. A multi-worm tracker can collect video data from enough indi- 
viduals to distinguish at least some phenotypes in about 1 hour. This number 
could be decreased with cameras with more pixels and faster computers, bring- 
ing, for example, a genome-wide screen within reach. For groups interested in a 
particular uncommon phenotype, this could be sufficient. 

For example, one of the main phenotypes described by Swierczek et al. was tap 
habituation. This is the rate at which worms become insensitive to mechanical 
stimulation provided by repeated taps to their plate. If there are a relatively 
small number of genes involved, then a large-scale screen could reveal a tractable 
number of candidate mutants that can be followed up for further functional 
analysis. If, on the other hand, one is interested in proprioception, a genome- 
wide search for mutants with a curvature phenotype is likely to yield a large 
number of hits because there are many ways of disrupting neural connections or 
signaling pathways that lead to an increase or decrease in curvature. In this case it 
would be desirable to have a more detailed behavioural fingerprint for clustering 
worms into related phenotypic classes that may be functionally related as has 
been demonstrated for a small number of mutants. Then it would be possible to 
target follow-up experiments to particular phenotypic sub-classes of particular 
interest. 

This consideration of phenotypic content may be especially critical for future 
large-scale behavioural screens. Multi-worm trackers provide greater throughput, 
but if they lack the sensitivity to meaningfully distinguish or cluster hundreds 
or thousands of mutant strains this throughput may be difficult to put to good 
use. Analysis at this scale has not yet been attempted for worm behaviour so 
this remains an open question. The development of high-resolution multi-worm 
trackers may help (Swierczek et al. 2011). 

1.4.2 Phenotyping by imaging neuromuscular activity 

Behaviour is the result of a potentially complex feedback between sensation, neu- 
romuscular activity, anatomy, and the physical properties of the environment. 
Isolating the contributions from each of these factors is a challenge, but one area 
where there are likely to be rapid advances is in linking the activity in specific 
neural circuits and muscles with behaviour and ultimately with genetics. The 
main technology driving this advance is the optical recording of neural activ- 
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ity with genetically encoded reporters in freely behaving animals. Fluorescence 
imaging is relatively non-invasive and is perfectly suited to recording from mul- 
tiple cells simultaneously. Moreover, nematodes are largely transparent, making 
their neurons easily accessible to optical recording. Indeed, the use of genetically- 
encoded sensors to record neural activity was first demonstrated in C. elegans 
(Kerr, Lev-Ram, Baird, Vincent, Tsien & Schafcr 2000). 

Currently the most widely used probes are genetically-encoded calcium in- 
dicators (GECIs) that change their emission in response to changes in Ca + 
concentration. This can be either through a change in Forster resonance energy 
transfer (FRET) efficiency in the case of cameleons (Miyawaki, Llopis, Heim, 
McCaffcry, Adams, Ikura & Tsien 1997) or intensity in the case of G-CaMP 
(Nakai, Ohkura & Imoto 2001) and related sensors. There has also been some 
recent notable success in genetically encoded sensors designed to directly sense 
voltage across cell membranes rather than a second messenger like Ca 2+ (Kralj , 
Douglass, Hochbaum, Maclaurin & Cohen 2011). Voltage sensors are not lim- 
ited by the timescale of cellular Ca 2+ dynamics and do not rely on the proper 
functioning of endogenous voltage-gated Ca 2+ channels. Despite their promise, 
genetically encoded voltage sensors have not yet been developed to the same 
extent as Ca 2+ sensors and are not yet widely used in intact animals. 

In worms, GECIs have been used to record the activities of a wide range 
of neurons. In immobilised animals, the responses of individual sensory neu- 
rons to mechanical (Suzuki, Kerr, Bianchi, Fr0kjaer- Jensen, Slone, Xue, Ger- 
stbrein, Driscoll & Schafer 2003), proprioceptive (Li et al. 2006), and chemi- 
cal (Hilliard, Apicella, Kerr, Suzuki, Bazzicalupo & Schafer 2005) (Chalasani, 
Chronis, Tsunozaki, Gray, Ramot, Goodman & Bargmann 2007) and thermal 
(Biron et al. 2006) (Kimura, Miyawaki, Matsumoto & Mori 2004) stimuli has 
helped dissect the role of specific neurons in neural circuits (Suzuki, Thiele, Fau- 
mont, Ezcurra, Lockery & Schafer 2008) (Macosko, Pokala, Feinberg, Chalasani, 
Butcher, Clardy & Bargmann 2009). It is this ability to correlate a macroscopic 
behavioural response with the underlying neural circuits that makes neural imag- 
ing such an exciting complement to whole-animal behavioural quantification. 
Demonstrations of neural imaging in unconstrained worms are now emerging. 
Haspel et al. examined the correlation between motor neuron activity and loco- 
motion but were only able to resolve broad activity differences between forward 
and backward locomotion (Haspel, O'Donovan & Hart 2010). This work was 
extended by Kawano et al. who found that an imbalance in motor neuron ac- 
tivity governs the forward or backward state and identified gap junction genes 
involved in regulating these states (Kawano, Po, Gao, Leung, Ryu & Zhen 2011). 
In another study, Faumont et al. showed very fast tracking and high-resolution 
functional imaging using a quadrant photodiode (rather than image) based track- 
ing system (Faumont, Rondeau, Thiele, Lawton, McCormick, Sottile, Griesbeck, 
Heckscher, Roberts, Doe & Lockery 2011). 

The combination of neural imaging and worm tracking is in a state analo- 
gous to worm tracking itself a decade ago: there have been some interesting 



18 



Andre E.X. Brown and William R. Schafer 



demonstrations that have focused on proving feasibility and answering specific 
questions about locomotion or the action of specific genes. Genetically encoded 
Ca 2+ and voltage indicators have ever-improving sensitivity response time, and 
signal to noise ratio (Looger & Griesbeck 2012). At the same time, sensitive cam- 
eras are getting faster and cheaper. An inexpensive general-purpose system for 
both tracking and neural imaging would vastly increase the scope of behavioural 
phenotyping in worms in part because mutations that lead to subtle defects 
in particular neural circuits may be masked by compensation in another circuit. 
This may be a factor that limits the discriminative power in genome- wide screens 
for behavioural mutants. Even a coarse-grained view of neural circuit dynamics 
may offer significant improvements in discriminative power (Fig. 1.4 1. 

Any attempt to integrate neural activity into automated behavioural genetics 
will face an explosion of data, not just in volume, but also dimension. Fortunately, 
with appropriate analysis it should still be possible to devise an abstraction that 
makes the data amenable to existing (and future) methods for high-dimensional 
data. A critical challenge will be to search for a low dimensional representation 
that still captures important variation as was done for C. elegans locomotion 
(Stephens et al. 2008). The generality of this approach (Stephens, Osborne & 
Bialek 2011) bodes well for applications in neural imaging, especially in a rela- 
tively tractable nervous system like C. elegans\ 



1.4.3 Beyond spontaneous behaviour 

C. elegans^ relative simplicity has led to speculation that it may be possible to 
quantitatively describe its entire behavioural repertoire (Stephens et al. 2008). 
This may be possible in the near future for spontaneous locomotion on a sur- 
face, but worms are capable of much more. Exactly how much more is not yet 
known quantitatively, but finding out will require an extension of behavioural 
observation beyond spontaneous behaviour on a featureless agar plate. 

C. elegans responds to a wide range of stimuli. For example, when gently 
touched on their anterior half, worms will reverse. When touched on their poste- 
rior half, they accelerate forward and these responses are controlled by specific 
neural circuits (Chalfie, Sulston, White, Southgate, Thomson & Brenner 1985). 
Worms also sense soluble and volatile chemical cues including salt, cAMP, bi- 
otin, carbon dioxide, and oxygen (Bargmann 2006). Worms actively chemotax 
up attractant gradients and display escape behaviours around noxious chemicals. 
Worms also respond to temperature and thermotax toward their most recent cul- 
tivation temperature. They will even accurately track isotherms when placed in 
a temperature gradient (Luo, Clark, Biron, Mahadevan & Samuel 2006). These 
behaviours (and the associated genes and neural circuits) have primarily been 
discovered using manual assays and expert observation, but new technologies 
are constantly being developed to increase sensitivity and reproducibility and to 
decrease the subjectivity that can creep into manually scored experiments. Here 
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we review a selection of promising directions to give a sense for what is currently 
possible and where it might lead. 

If a worm crawling on an agar surface is suddenly placed in a drop of liq- 
uid, it will rapidly change to a swimming gait characterised by a higher fre- 
quency of body bending and a longer body wavelength (Fig. |1.1[ ). This repre- 
sents quite an extreme change in the worm's mechanical environment and its 
response seems to be under genetic control (Vidal-Gadea, Topper, Young, Crisp, 
Kressin, Elbel, Maples, Brauner, Erbguth, Axelrod, Gottschalk, Siegel & Pierce- 
Shimomura 2011). More subtle transitions are also possible and these have been 
observed by changing viscosity (Korta, Clark, Gabel, Mahadevan & Samuel 2007) 
(Bcrri, Boyle, Tassieri, Hope & Cohen 2009) and more recently with compressive 
force (Lebois, Sauvage, Py, Cardoso, Ladoux, Hersen & DiMeglio 2012). The ad- 
vantage of the latter approach is both that the force can easily be changed in 
real-time and also that there is less chance of inhomogcneity, which can compli- 
cate interpretation in studies of swimming in viscous fluids (Boyle, Berri, Tassieri, 
Hope & Cohen 2011). What makes these studies so interesting is that an accu- 
rate response to viscosity will likely require an integration of senses, for example 
a sense of the applied force of muscles and of touch. Another possibility would 
be a sense of applied force combined with sense of posture over time. The role of 
touch in gait adaptation has been established (Korta et al. 2007), but proprio- 
ception has not yet been directly implicated and it is not known whether worms 
have a direct sense of effort. Further tracking experiments combined with ge- 
netic and neural ablations as well as careful theoretical work (Fang- Yen, Wyart, 
Xie, Kawai, Kodger, Chen, Wen & Samuel 2010) (Sznitman, Shen, Purohit & 
Arratia 2010) (Boyle, Bryden & Cohen n.d.) (Sauvage, Argentina, Drappier, 
Senden, Simon & Di Meglio 2011) (Mailler, Avery, Graves & Willy 2010) will be 
required to resolve this case of sensory integration and gait computation. 

Microfluidics has opened many opportunities for precisely handling and im- 
mobilizing worms and for providing specific stimulations. Two-layer devices with 
pneumatic valves can be used for directing worms to particular chip locations 
where they can be clamped and assayed (Fig. |1.6| . This has been used for stud- 
ies of laser axotomy and axon regeneration (Yanik, Cinar, Cinar, Chisholm, 
Jin & Ben-Yakar 2004), automated (Chung & Lu 2009) and semi-automated 
(Hulme, Shevkoplyas, Apfeld, Fontana & Whitesides 2007) neuron ablation, 
and for neural imaging of responses to soluble chemicals (Chronis, Zimmer & 
Bargmann 2007) (Chokshi, Bazopoulou & Chronis 2010). See Chronis (2010) for 
a review of some of the many related applications. Of particular interest for ex- 
tending behavioural phenotyping are applications that control the environment 
while allowing relatively free locomotion. Extending the concept of articifcial 
dirt (Lockery, Lawton, Doll, Faumont, Coulthard, Thiele, Chronis, McCormick, 
Goodman & Pruitt 2008), Albrecht et al. have taken advantage of the laminar 
flow that exists at low Reynolds number in microfluidic devices to apply precise 
gradients and sharp boundaries of soluble chemicals to study chemosensation and 
chemotaxis quantitatively in a potentially high-throughput manner (Albrecht & 
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Figure 1.6 A) Schematic drawing of a pneumatic worm clamping device. B) 
Fluorescent image of a clamped worm. A and B are adapted from Yanik et al. (2004). 
C) Microfluidic device for presenting soluble chemicals in sharp steps to worms. D) 
Close-up view of a worm in the device shown in C. C and D are adapted from 
Albrecht & Bargmann (2011) 



Bargmann 2011). Another particularly simple and provocative experiment sug- 
gested that worms could remember the solution to a simple microfluidic T-maze 
(Qin & Wheeler 2007). 

Another method of applying controlled simuli is provided by optogenetics 
(Fenno, Yizhar & Deisseroth 2011). The basic approach is to express a light gated 
ion channel using promoters specific to a subset of neurons. Then, when these 
channels are activated by light they either depolarize and excite or hyperpolarize 
and inhibit the neurons where they are expressed. By controlling expression, a 
simple apparatus can be used to activate or inhibit specific neurons and mon- 
itor the effect on behaviour (Nagel, Brauner, Liewald, Adeishvili, Bamberg &; 
Gottschalk 2005). However, because promoters are often not available that are 
specific to single neurons, it is desirable to be able to target a particular neu- 
ron from a population that expresses the light-sensitive channel. This has the 
added advantage that different neurons can be targeted in the same animal in 
a single experiment. Following from earlier work that combined activation and 
imaging in immobilized worms (Guo, Hart & Ramanathan 2009), two groups 
simultaneously published reports describing the activation of specific neurons in 
freely behaving animals using a motorized stage and a digital micromirror de- 
vice (Leifer et al. 2011) or a slightly modified projector (Stirman, Crane, Husson, 
Wabnig, Schultheis, Gottschalk & Lu 2011) for local illumination. The obvious 
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application of these systems is for investigating the role of specific neurons and 
neural circuits in behaviour, but there will also be applications in genetics. In 
particular, even for mutants with no obvious spontaneous locomotion defects it 
may be possible to detect changes in responses to neuronal stimulation that may 
reflect more subtle defects in neural circuit function. 

1.5 Conclusions 

The promise of automated behavioural phcnotyping in C. elegans has long been 
recognized and will soon be realized on a large scale. Quantitative data enables 
more meaningful comparisons with models of locomotion that are beginning to 
emerge and can in some cases lead to new insights into the diversity of — as well 
as constraints on — behaviour. The better we understand behaviour the more 
able we will be to design useful behavioural metrics and to interpret mutant 
phenotypes. Already, new algorithms and approaches are showing potential for 
applications in genetics and with new inexpensive high-throughput methods, 
open and useable software, and new bioinformatic approaches, these will be more 
widely adopted and applied to a broad range of topics. 

The next decade promises substantial advances in our understand of behaviour 
and the connection between genotype and this fascinating range of phenotypes. 
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